SMALLER POPULATION SIZE AT THE MRCA TIME FOR 
STATIONARY BRANCHING PROCESSES 
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Abstract. We present an elementary model of random size varying population given by a 
stationary continuous state branching process. For this model we compute the joint distri- 
bution of: the time to the most recent common ancestor, the size of the current population 
and the size of the population just before the most recent common ancestor (MRCA). In 
particular we show a natural mild bottleneck effect as the size of the population just before 
the MRCA is stochastically smaller than the size of the current population. We also compute 
the number of old families which corresponds to the number of individuals involved in the 
last coalescent event of the genealogical tree. By studying more precisely the genealogical 
structure of the population, we get asymptotics for the number of ancestors just before the 
current time. We give explicit computations in the case of the quadratic branching mech- 
anism. In this case, the size of the population at the MRCA is, in mean, less by 1/3 than 
size of the current population size. We also provide in this case the fluctuations for the 
renormalized number of ancestors. 



1. Introduction 

A large literature is devoted to constant size population models. It goes back to Wright 
|47| (1930) and Fisher |21j (1931) in discrete time, and Moran [39J (1958) in continuous 
time. Models for constant infinite population in continuous time with spatial motion were 
introduced by Fleming and Viot [22] (1979). On the other hand, the study of the genealogical 
tree of constant size population was initiated by Kingman [29] (1982), and described in a more 
general setting by Pitman [33] (1999) and Sagitov [36] (1999). The complete description of 
the genealogy of the Fleming- Viot process can be partially done using the historical super- 
process by Dawson and Perkins [13] (1991) and precisely by using the look-down process 
developed by Donnelly and Kurtz [144 115] (1999) or the stochastic flows from Bertoin and Le 
Gall PEnHII] (2003). 

It is however natural to consider random size varying population models. Branching popu- 
lation models, for which sizes of the population are random, goes back to Galton and Watson 
|23| (1873) in discrete time and with finite mass individual. Jirina [2B] (1958) considered 
continuous state branching process (CB) models corresponding to individuals with infinitesi- 
mal mass. The genealogy of those processes can be partially described through the historical 
super-process. However the continuum Levy tree introduced by Le Gall and Le Jan [M] 
(1998) and developed later by Duquesne and Le Gall [H] (2002) allows to give a complete 
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description of the genealogy in the critical and sub-critical cases. See the approach of Abra- 
ham and Delmas [1] (2008) or Berestycki, Kyprianou and Murillo [?J (2009) for a description 
of the genealogy in the super-critical cases. 

The two families of models: constant size population and branching populations are, in 
certain cases, linked. The case of a quadratic branching corresponds to the fact that only 
two genealogical lines of the population genealogical tree can merge together. In this par- 
ticular case, it is possible to establish links between the constant size population model and 
CB models. Thus, conditionally on having a constant population size, the Dawson- Watanabe 
super-process is a Fleming- Viot process, see Etheridge-March |18j (1991). On the other hand, 
using a time change (with speed proportional to the inverse of the population size) , it is pos- 
sible to recover a Fleming- Viot process from a Dawson- Watanabe super-process, see Perkins 
|41j (1992). Birkner, Blath, Capaldo, Etheridge, Mohle, Schweinsberg and Wakolbinger [12] 
(2005) have given similar results for stable branching mechanism. In the same spirit, Kaj 
and Krone [27] (2003) studied the genealogical structure of models of random size varying 
population models and recover the Kingman coalescent with a random time change. 

Recently, some authors studied the coalescent process (or genealogical tree) of random size 
varying population, in this direction see Mohle [38J (2002), Lambert [30J (2003) for branching 
process and Jagers and Sagitov [25] (2004) for stationary random size varying population. 

Our primary interest is to present an elementary model of random size varying population 
and exhibit some interesting property which could not be observed in constant size model. 
The most striking example is the natural mild bottleneck effect: in a stationary regime, the 
size of the population just before the most recent common ancestor (MRCA) is stochastically 
smaller than the current population size. Our second goal is to give some properties of the 
coalescent tree such as: time to the most recent common ancestor (TMRCA), number of 
individuals involved in the last coalescent event, asymptotic behavior of the number of recent 
ancestors. 

One of the major drawback of the branching population models is that either the population 
becomes extinct or decreases to 0, which happens with probability 1 in the (sub)critical cases, 
or blows up exponentially fast with positive probability in the super-critical case. In particular 
there is no stationary regime, and the study of the genealogy of a current population depends 
on the arbitrary original size and time of the initial population. To circumvent this problem, 
we consider a sub-critical CB, Y = (Yt,t > 0), with branching mechanism ift given by (pQ). We 
get the Q-process by conditioning Y to non-extinction (which is an event of zero probability), 
see [35] and [31]. The Q-process can also be seen as a CB with immigration, see [25]. We 
take the opportunity to present a probabilistic construction of independent interest for the 
Q-process in Corollary 13.51 which relies on a Williams' decomposition of CB described in [2]. 
A first study of the genealogical tree of the Q-process can be found in [30] . 

We consider the Q-process under its stationary distribution and defined on the real line: 
Z = (Z t ,t £ M). Its Laplace transform, see (|3.6p . is given by 



E e 



= exp I — 




A > 



t G R 



where ^(A) = "0(A) — X^'(0). In order for Zt to be finite, we shall assume condition (|A2|) : 
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In order for the TMRCA to be finite, we assume condition (|A1|) : 



Notice a very similar condition exists to characterize coalescent processes which descent from 
infinity, see [6]. 

As in the look-down representation for constant size population, we shall represent the 
process Z using the picture of an immortal individual which gives birth to independent sub- 
populations or families. For fixed time to = (which we can indeed choose to be equal to 
by stationarity) , we consider A the TMRCA of the population living at time 0, Z A = Z(_p^_ 
the size of the population just before the MRCA, Z 1 the size of the population at time 
which has been generated by the immortal individual over the time interval {—A, 0) and 
Z° = Zq — Z 1 the size of the population at time which has been generated by the immortal 
individual at time —A. In Theorem I4.lt we gi ye 

the joint distribution of {A, Z A , Z 1 , Z°). 

One interesting phenomenon is Corollary 14.31 

Corollary 1. Conditionally on A; Z A , Z 1 and Z° are independent. 

In particular, conditionally on A, Z A and Z are independent. Conditionally on A, Z A 
depends on the past before —A of the process Z and has to die at time 0, Z° corresponds to 
the size of the population at time generated at time — A and Z 1 corresponds to the size of the 
population at time generated by the immortal individual over the time interval (—^4,0). 
Then, as the immortal individual gives birth to independent populations, the Corollary is 
then intuitively clear. 

One of the most striking result, the natural mild bottleneck effect, is stated in Proposi- 
tion H3J 

Proposition 2. Z A is stochastically smaller than Zq. 

Thus just before the MRCA, the population size is unusually small. Notice this result 
is not true in general if one considers the size of the population at the MRCA instead of 
just before, see Remark 14.61 We get nice quantitative results for the quadratic branching 
mechanism case, see Corollary 17.21 

Corollary 3. Assume tp is quadratic (and given by \4^)- We have: a.s. 



Notice that even is Z is stochastically smaller than Z$ it is not a.s. smaller. 

We also give in Theorem 14 . 71 the joint distribution of Zq and the TMRCA of the immortal 
individual and n individuals picked at random in the population at time 0. See also related 
results in [30] . 

We investigate in Proposition 15.21 the joint distribution of A, Zq and N A , where N A + 1 
represents the number of individuals involved in the last coalescent event of the genealogical 
tree. Under a first moment condition on Z, we get that if the TMRCA is large, then the last 
coalescent event is likely to involve only two individuals. In the stable case, this first moment 
condition is not satisfied, and the last coalescent event does not depend on the TMRCA, see 
Remark 15.51 This suggests a result similar to the one obtained in [12]: in the stable case, 




an in particular: 
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the topology of the genealogical tree (which does not take into account the length of the 
branches) may not depend on its depth given by the TMRCA. 

After giving a more precise description of the genealogy of Z using continuum Levy trees, 
we compute in Theorem 16.91 the asymptotic behavior of the number of ancestors at time — s, 
M s , of the population at time 0. 

Theorem 4. The following convergence holds in probability: 

siO C(s) 

f°° dv 

where c(s) is related to the extinction probability ofY and defined by / — — = t. 



c(t) 



This result is very similar to the one obtained on coalescent process in [6j (notice the con- 
vergence is a.s. in [6]). We can precise the fluctuations in the quadratic case, see Theorem l7,8l 

Theorem 5. Assume ip is quadratic (and given by ft4>fy)- We have 

Vc(s)E[Z ] (Ifi- - Zo) (Z - Z' 

\C(S) J si0+ 

where Z Q is distributed as Zq and independent of Zq. 

The paper is organized as follows. We first recall well known facts on CB in Section (2) We 
introduce in Section [3] the corresponding stationary CB, which is related to the Q-process 
of the CB, and give its first properties. We give the joint distribution of (A, Z A , Z 1 , Z°) in 
Section [5] and prove the natural bottleneck effect, that is Z A is stochasitcally smaller than 
Zq. We compute the number of old families (or number of individuals involved in the last 
coalescent event) in Section [5] and the asymptotics of the number of ancestors in Section [6l 
A first consequent part of this latter Section is devoted to the introduction of the genealogy 
of CB processes using continuum random Levy trees. We give more detailed results in the 
quadratic branching setting of Section [7J 

2. Continuous-state branching process (CB) 

We recall some well-known fact on continuous-state branching process (CB), see for ex- 
ample |36j and references therein. We consider a sub-critical branching mechanism ip: for 
A > 0, 



(1) ip{\) =aA + /3A 2 + / Tr(d£) 

where a = ^'(0) > 0, (3 > and ir is a Radon measure on (0, +oo) such that J*( +oo)(^ ^ 
£ 2 ) ir(d£) < +oo. We consider the non trivial case that is either [3 > or vr((0, 1)) = +oo. 
Notice that ip is convex, of class C 1 on [0, +oo) and of class C°° on (0, +oo) and ip"(0+) £ 
(0,+oo]. 

Let P x be the law of a CB Y = (Yt,t > 0) started at mass x > and with branching 
mechanism if), and let be the corresponding expectation. The process Y is a cad-lag Re- 
valued Feller process and is a cemetery point. The process Y has no fixed discontinuities. 
For every A > 0, for every t > 0, we have 

(2) E x \e~ XYt ] = e - xu ^' t \ 
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where the function u is the unique non-negative solution of 



(3) 



u(X,t) + ip(u(X,s))ds = X, A>0, t>0. 



o 



Note that the function u is equivalently characterized as the unique non-negative solution of 

rA ilr 



(4) , 

(A,t) V\r) 

or as the unique non-negative solution of: for A > 0, 



/ 

Ju( 



t A > 0, t > 0. 



(5) 



d t u + ij){u) 
«(A,0) 



t > 0, 
A. 



Markov property of Y implies that for all A, s, t > 0: 

(6) u(u(X,t), s) = u(X,t + s). 

Let N be the canonical measure (we shall also call it excursion measure) associated to 
Y. It is a cr-finite measure which intuitively describe the distribution of Y started at an 
infinitesimal mass. We recall that if 

^2S XitY i(dx,dY) 
is a Poisson point measure with intensity l[o,+oo)( x ) dscN[dY], then 

(7) E 1 ^^}^ 

i&I 

is distributed as Y under P x .. In particular, we have: for A > 

u(X,t). 

For convenience, we shall put Yt = for t < 0. 

Let C = inf{t; Yj = 0} be the extinction time of Y. We consider the function: 



N 


"l_ e -W* 


= lim — E x 


"l-e- Ay *" 






x4.o rc 





(8) 



c(t) = N[C > t] = N[y t > 0] = lim f u(X,t). 



We shall assume throughout this paper, but for Sections 13. 1 1 and [3^31 that the strong extinction 
property holds: 

ro ° dv 



(Al) 



< +OO. 



It follows from and ([8]) that c is the unique non-negative solution of: 

rD0 du 



(9) 



(t) ^(«) 



t, t > 0. 



Thanks to (jAip . we get that c(t) is finite for alH > and N[C = +oo] = 0. We also get that 
c is continuous decreasing and thus one-to-one from (0, +oo) to (0, +oo). Letting A goes to 
infinity in (|6|) yields that for s, t > 



(10) 



u(c(t), s) = c(t + s). 
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3. Stationary CB 

In contrast to Wright-Fisher population models, CB models do not exhibit stationary 
distributions. However, by conditioning sub-critical CB to non-extinction (see [15], [20] and 
[31] for details), one get the so-called Q-process, which we denotes by Y" . This process is also a 
CB process with immigration introduced in [28] and may have a stationary distribution. This 
process, as pointed out in [3] see also [19], has a heuristic interpretation by introducing a fixed 
ancestral lineage, namely it is an independent sum of the process Y and the population thrown 
off by an "immortal individual" whose laws coincide with the law of a generic population Y. 

We introduce the process Y" in Section 13.11 as well as its stationary version Z. Then we 
check in Section 13.21 that under (|Aip the process Y" is indeed the Q-process associated to 
Y. This gives then a natural interpretation of Z. We give preliminary results on the process 
Z in Sections 13.31 and 13.41 

3.1. Poisson point measure of CB. We consider the following Poisson point measures. 

• Let Mo(dr, dt) = X^e/ 3(n,ti)(,dr, dt) be a Poisson point measure on (0, +oo) x E with 
intensity 

r ir(dr)dt. 

• Conditionally on Mo, let (Mi t i,i € I), where M\ t i(dt, dY) = XjeJi i^tj,Yi{dt,dY), be 
independent Poisson point measures with respective intensity 

n6 ti {dt)N[dY}. 

Notice that for all j £ Jn, we have tj = tj. We set J\ = IJie/ Ji % an d M\{dt, dY) = 
>: r h » l: .y«H.<IY;. 

• Let M2{dt,dY) = Ylj£.j 2 ^tjy i (dt,dY) be a Poisson point measure independent of 
(Mo, Mi) and with intensity 

2/3 dt N[dY}. 

We set J = J\ [J J2. We shall call Y 3 , with j £ J a family and tj its birth time. 

We will consider the two following processes Y" = (Y",t > 0) and its stationary version 
Z = (Z t ,t G M): 

(11) Yl'= E Y L P 

j&J,tj>0 

(12) Z t = J2Yl tj - 

We will denote by P the probability under which Y" and Z are defined and E the correspond- 
ing expectation. 

At this stage, let us emphasize there is another natural decomposition of Y" and Z. For 
i E I, set Y l = X^jeJi ^ 3 and X = / (J J2. The random measure 

(13) M 3 (dt,dY) = J2*t i ,Yi(dt,dY) 
is a Poisson point measure with intensity dtfj,(dY) and 



(14) 



fx(dY) = 2/3N[dY] + f £ir(d£) P e (dY). 
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And we have: 
(15) 



(16) 



iel;U>0 



We shall call Y l , with i 6 I a clan and ti its birth time. For j £ J2, Y° is a clan and a 
family. Notice that a.s. two clans have different birth time, but families in the same clan 
have the same birth time. 

The presentation with clans is simpler than the representation with families and most of the 
results can be obtained using the former representation. We will use the family representation 
in Sections [5] and [U 

We define ip by: 

(17) $(A) = V(A) - \i//(Q) = V(A) - a\. 

We first give a Lemma on the family representation. 
Lemma 3.1. Let F be a non-negative measurable function. We have 



;is) 



E 



exp 



dt^(N[l-e F (*' y )]) 



Proof. Using Poisson point measure properties, we get: 



E 



E 
E 



E 



e -2/3/dtN[l-o F ( t . y )] 



Q-Zieinm-my)} 

e -/#/(0,+oQ)M<*9 (l-«p(-<N[l-F(t,Y)])) e -2/3fdtNll-c F ^] 



fdt i/; , (N[l-e F ( t ' y )]) 



□ 



Proposition 3.2. T/ie process Y" is a CB with branching mechanism if) and immigration 
function ip' : 



= 2/3A + 



(0,+oo) 



£-jr(d£) (1 



started at Yq = 0. 



Proof. This is a direct consequence of Lemma 13.11 and results from |28j . 



□ 



In particular Y" is a strong Markov process started at and its transition kernel is char- 
acterized by: for A > 0, t > 0, r > 



E[e 



t IYq" = r] = exp 



-ru(A, t) 



tp'(u(\,s)) ds). 



The next result is then straightforward. 



Corollary 3.3. For each t e I, {Z s ;s > £} /ias the same law as a CB with branching 
mechanism ip and immigration function ip' started at the invariant distribution ¥(Zt G •). 
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3.2. Q-process. We check the process Y" is indeed the Q-process for CB using Williams' 
decomposition. 

Let m > and v m (dt) = Yliel r i$u{dt), where Yliei 3(r i ,t i )(d''', dt) is a Poisson point mea- 
sure with intensity 

l[o,m](*) e ~ rC(m ~ t) r <dr)dt. 
Conditionally on v m , let Ylj^j™- $tj,Yi(dt,dY) be a Poisson point measure with intensity 

(v m (dt) + 2/31 [0H (t) dt) N[dY,( < m - t]. 
The next Proposition is a consequence of Theorem 3.3 in [2]. 

Proposition 3.4. Assume \A1\) holds. Under N, conditionally on {Q = m}, Y is distributed 
as (Y[,t > 0) where 

It is then easy to deduce the following Corollary using representation (|15p of Y". 

Corollary 3.5. Assume \A1\) holds. The limit distribution of Y under N, conditionally on 
{C = m}, as m goes to infinity, is the distribution ofY" from Proposition\ 



Corollary 13.51 readily implies that the Q-process associated to Y, that is the limit distri- 
bution of Y under N, conditionally on {£ > m}, as m goes to infinity, is the distribution of 
Y" from Proposition! 



3.3. Stationary CB. We first give an interpretation of Z in population terms. At time t, 
Zt correspond to the size of a population generated by an immortal individual (with zero 
mass) which gives birth at rate 2(3 to clans (or families) which sizes evolve independently 
as Y under N and at rate 1 with intensity r iv(dr) to clans with initial size r which evolve 
independently as Y under P r . 

By construction the process Z is stationary. The next Lemma which gives the Laplace 
transform of Z is a direct consequence of the construction of Z. 

Lemma 3.6. For all t £8 and A > 0, the Laplace transform of Zt is given by: 
(19) E r e " AZt l = exp ( - / ds i>' '(«(A, a)) 



Proof. Using Lemma l3.lt we have: 



E 



e -AZ t 



exp (- J ds ^'(N[l - e- XYt ' s })^j = exp ^- J ds 4>'(u(\, a)) 



□ 



We shall consider the following assumption 

/+oo 
l]og{£) ir{dt) < +oo. 

The next Lemma is well known (notice condition (lAlj) is not assumed). 

Lemma 3.7. In the sub-critical case, the following conditions are equivalent: 
(i) (ED holds. 

( %% ) Jo (s? " W)) dv < + °°- 

(Hi) E r [lt log(lt)] < +°° f or some t > and r > 0. 
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(iv) K r [Y t log(Y t )] < +00 for all t > and r > 0. 

Proof. For (i) 44> (ii) see [21] proof of Theorem 4a, and for (ii) 4^ (iii) (or (iv)) use Lemma 1 
p.25 of [5]. □ 

The next Proposition gives a condition for finiteness of Z, see also [12] in a more general 
framework. 

Proposition 3.8. We have V(Zq < +00) = 1 if and only if holds. 

Proof. Thanks to (fl~9|h we get P(Zo < +00) = 1 if and only if lim^o / °° ds ij/(u(X, s)) = 0. 
As A I—)- u(X, s) decreases to as A goes down to for all s > 0, we deduce by dominated 
convergence that F(Z < +00) = 1 if and only if J ds ip'(u(\,s)) < +00 for at least one 
A > 0. 

Notice that d t u+ip(u) = implies ip'(u) = —dfu/d t u, and hence for every < t < T < +00 
we have 

We deduce that T H> ip(u(X,T)) e aT is decreasing. We also get that ds ijj'(u(X,s)) < 
+00 if and only if limT-s>+oo ip(u(X,T)) e aT > or equivalently limr->+oo u(X, T) e aT > as 
limT->>+oo u(\, T) = thanks to (|4|). 
We deduce from ((4|) that 

r-A 



(21) 



u(X,T)e aT = Aexp [ a 



1 

ar 



Thus we deduce from Lemma 13.71 that E(Zq < +00) = 1 if and only if (|A2j) holds. 
Corollary 3.9. Assume lA^) holds. We have for A > 0, t £ R: 



□ 



(22) E 
In particular, we have: 
(23) 

Proof. We deduce from f)19|) that: 



2 ( e 



-AZ t 



g(A) 



E 



-xz t 



nZ t ] = ^£(0, + oo]. 



il/ (0) 



E 



Z t e 



-xz t 



E 



-AZ t 



d x 4>'(u(\,s))ds. 



We deduce from dU that A h-> u(A, s) is increasing and of class C°° on (0, +00) and that 

ip(u(X,s)) -d s u(X,s) 



(24) 
Thus, we get: 



d x u(X,s) 



V>(A) 



dx 



/•oo /*oo 

f ip'(u(X, s)) ds = / ip" (u(X, s))dxu(X, s) ds 
Jo 

ip"(u(X, s))d s u(X, s) ds 



1 

V'(A)' 
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The last part of the Corollary is immediate. □ 



Remark 3.10. Assumption (|Aip is not needed to define the process Y" or the stationary 
process Z. However the study of MRCA for Z is not relevant if (|Aip does not hold. 

Notice, we will introduce a complete genealogical structure for Z in Section [6] by using a 
genealogical structure of the families (Y 1 ,j G J). 

From now on, we shall assume that (]A1|) and ()A2|) are in force. 

3.4. Further property for stationary CB. By construction, we deduce that for all 
the process (Z s+ t,s > 0) is a CB with branching mechanism ip and immigration function 
ifj' started as the stationary distribution whose Laplace transform is given by (|19p. Then 
Proposition 1.1 in [28] implies that Z is a Hunt process and in particular it is cad-lag and 
strongly Markov taking values in [0, +oo]. By stationarity and since +oo is a cemetery point 
for Z, we deduce that a.s. for all t G M, Zt is finite. 

Next, we recall some asymptotic properties of the functions u and c given in Lemma 3.1 
of [30]. 

Lemma 3.11. For every A G (0, oo), we have 

(25) i im ^fM = e -«- 1 (A) 

V ' t^oo c(t) 

and there exists G (0, oo) such that 

(26) lim c(t)e' 

t— >oo 



at 



We also compute some integral of tp' . 
Proposition 3.12. The fallowings hold for every < t < oo: 

(27) ri/ WA ,, ))Jl= ,j!«*^), A> „. 

Jt \ K *" J 

(28) /> (c( »)) d » = U«"'' 
Jt \ no- 
where the constant defined in Lemma \3.11\ 

Proof. We deduce from ([20]), ([25]) and ([26]) that: 

lim ^(u(A,T))e^ = lim ^ M / A ff> ^ c{T)^ = a 

and (j27[) follows by letting T — s- oo for both sides of (|20p . Then, let A goes to infinity in 
(j27[) to get (|28p and use the monotone convergence theorem. □ 

As a consequence of (|27p with £ = and Lemma 13.61 we get the following Corollary. 

Corollary 3.13. For all t G M and A > 0, t/ie Laplace transform of Zt is given by: 



(29) E e~ AZi =exp^-^ ds ^'(u(A, s)) ^ 



V>(A) 

Eventually, we check that Z is non-zero. Recall notations from Section 13. 11 Let Q = 
inf{t > 0; 1? = 0} be the duration of the family or clan Y l and ti + Q its extinction time, 
with i in /, J\ or J2. 
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Proposition 3.14. We have 

^ 1 1, 



In particular, we have P(3i 6 R; Zt = 0) = 0. 

For — oo < a < b < +oo, we will consider in the forthcoming proof 

( 30 ) N a,b = E 1 {U<a;b<U+Q}^ 

the number of clans born before a and still alive at time b. Notice N a ^ is a Poisson random 
variable with parameter 



A(& - a) : = J drfi(dY) l ( _ 005a) (r)l {c+r>b} 
dr ij) (c(r)) 



(31) = log 



b— a 

V>(c(6-a))e Q ( 6 - a )' 



where we used (Q3D the definition of /x for the first equality and (|28p for the last equality. 

Proof. Observe that no clan surviving at time t £ (a, b) implies that there are no clan sur- 
viving on any non-degenerate interval containing t. Hence, for any n > 1, we have: 

{1 n ra+l 

3t G (a, b), £ l (tilti+Ci) (t) = C |J {N v ._ uU . = 0} U J {^V^ = 0} 
iez J i=l i=i 

where itj = o + j(b — a)/n and Uj = a + (2j — 1)(6 — a) /2n. Notice that N Uj _ ltUj and N Vj _ X)Vj 
are Poisson random variables with parameter n = A((6 — a)/n). We deduce that 



(32) 



(3te(a,b), X)l(t 4A+W (*) = 0^ <(2n + l)e- ". 



Therefore the first part of the Proposition will be proved as soon as lim n -_>+oo n exp(—9 n ) = 

which, thanks to formula (1311) . will be implied by \im tip (c(t)) = +00 and thus by 

t— >o 

(33) lim / TTT^(A) = +00. 

Hypothesis on /3 and ir imply there exists a constant cq > such that 
aX < ip(X) < cqX 2 and lim ip(X)/X = +00. 

A— >+oo 

Therefore (|33|) is in force. 

The second part of the Proposition is clear by definition of Q and representation (|16p . □ 
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4. TMRCA AND POPULATIONS SIZES 

We consider the coalescence of the genealogy at a fixed time to- Thanks to stationarity, 
we may assume that to = and we write Z instead of Zq. There are infinitely many clans 
contributing to the population at time 0. The Poisson random variable introduced in (|30p . 
with 6 = 0, gives the number of clans born before a and still alive at time 0. Notice its 
parameter is finite, see (|31|) . Therefore, there are only finitely many clans born before a and 
alive at time 0. In particular, this implies that there is one unique oldest clan alive at time 
0. We denote by —A the birth time of this unique oldest clan at time 0: 

A = - w£{ti < 0; Yi u >0,i£ 1}. 

We set Z° the population size of this clan at time 0: 



Z' 



o ._ 



t, - 



if A 



-U, 



The time A is also the time to the most recent common ancestor (TMRCA) of the population 
at time 0. The size of all the clans alive at time with birth time in (—A, 0) is given by 

Z 1 := Z - Z°. 

We are also interested in the size of the population just before the most recent common 
ancestor (MRCA): 



Z A := Z 



{-A)- 



Z^^-A-t^MtiK-A}- 



Theorem 4.1. The joint distribution of (A, Z A , Z 1 , Z°) is characterized by the following: 
for A, 7, rj > and t > 0, 



(34) E 



t poo 

ds ^'(w(7, s)) — / ds ip' (u(X + c(t) , s)) 
o Jo 



- e - XZ A_ yZl _ vZ Aedt 

= dt {i>'{c{t)) - $(u(r], t))j x exp 

Proof. Given / a non-negative Borel measurable function defined on M, we have 
E \ e -^ A -^-vZ° 

E]Texp(-A £ 7 E Y -U~vYl t 



f(-tj) 1 



POO 

/ dtfi(e-^ Yt ;Y t >0)/(t)E 
J o 



exp [ -7 E Y '-U 

iEZ,ti>-t 



lim E 

K— >oo 



exp -A £ y ( 



U<-t 



y-t-U) +K1 {y1 h >0} 



where we used that Poisson point measures over disjoint sets are independent. We have: 
H {e~^;Y t > 0) = M (l {yt>0} - (l - e"^)) = ^W*)) " 
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Using Lemma 13.11 we get: 
E 

We also have: 



exp ( -7 ^2 Y - 

i£Z,ti>—t 



i 

-U 



cxp l — J ds $'(u(7, s)) 



lim E 

K— >oo 



cxp 



iex,ti<—t 



Y (-t-u) + K1 { Ylh>0 y 



exp (- J ds 1 {S>0} n(l-e XYs l {Ys+t =o})^j 
exp (- J ds 1 {S>0} M (l - e~ XY ' P Ys (Y t = 0))) 
exp (-Jds l {s>0} ^(l-e-( A+c ^ y ; 
ds $(u(\ + c(t),s))) , 



= exp 

where we used exponential formulas for Poisson point measure in the first equality and the 
Markov property of Y for the second equality. Putting things together, we then get (|34p. □ 

It is then easy to derive the distribution of the TMRCA A. 

Corollary 4.2. The distribution function of A is given by 

¥(A <t)= E[e- C ® z ] = exp (- J ds i>'(c(s)) \ , 

and A has density, fj±, with respect to the Lebesgue measure given by: 

(35) f A (t) = ${c(t)) exp (- J™ ds fi(c(8))\ l {t>0} = e~ at K*al {t>0} . 

Proof. This is a direct consequence of Theorem 14 . 1 1 and (|10p . Use Lemma [3.6l to get (|35|) . □ 

The next result is a direct consequence of Theorem 14.11 

Corollary 4.3. Conditionally on A, the three random variables Z 1 , Z A and Z° are indepen- 
dent. 

We can also give the mean of the population size just before the most recent common 
ancestor (MRCA) (to be compared to the mean size of the current population given by (|23p ). 



Corollary 4.4. Let t > 0. We have 



(36) 



E 



e- xzA \A = t 



E [e 



-(X+c(t))Z] 



and E[Z A \A = t] 



${c{t)) 



Proof. This is a direct consequence of Theorem 14.11 and of (|22p . 



□ 



We deduce from (j36"P that the distribution of Z A conditionally on {A = t} converges, as t 
goes to infinity, to the distribution of Z. 
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As another application of Theorem l4.lt we get that the population just before the MRCA, 
Z A , is stochastically smaller than the current population, Z. Note that strong inequality, 
namely inequality in the almost-surely sense, does not hold in general (see Section [7]). 

Proposition 4.5. We have P(Z A < z\A = t) > F(Z < z) for all z > and t > 0. Hence, 
the population size Z A is stochastically smaller than Z : ¥(Z A < z) > ¥(Z < z) for all z > 0. 
In particular, we have 

E[Z A \A] < E[Z] a.s. 



Proof. The first equality of (|36|) implies that for any non-negative measurable function F 
defined on M, 

nnz A )\A = t]= Lj_j__ . 

Note that e~ c ^ z — E [e _c (*) z ] is non-negative for Z less than log (E [e" c ^ z ] ) and non- 
positive otherwise, and that lim*-** E[e~ c ^ z ; Z < z] - E[e" c W z ]P(Z < z) = 0. We deduce 
that: 

P Z A < z A = t) = — ^ — F - J > P(Z < z). 

7 E [e- c W z ] - v - ; 

For the last assertion, recall that for any non-negative random variable, we have E[X] = 
J °° F(X > x)dx. □ 

Remark 4.6. Instead of considering Z A , the size of the population just before the MRCA, we 
could consider the size of the population at the MRCA, Z A , which is formally given by 

z A = z A +Y J n\u=-A Y 

Notice we don't take into account the contribution of i E J2 as for those indices we have 
Yq = 0. (In particular if 7r = 0, then Z is continuous and Z A = Z A .) Similar computations 
as those in the proof of Theorem 14.11 yield: for X,t > 

, „ ,1 mr „-A^ , „ _ +1 V>'(A + c(t)) - ^'(A) 



E[e _AZ + \A = t] =E[e _AZ \A = t]- 



If ip"(0) = +00, then we get that lim E[e" AZ + | A = t] = 0. Thus, conditionally on {A = t}, 

t— >+oo 

for t large, we have that Z A is likely to be very large. (Intuitively, a clan is born at time —t 
which has survive up to time 0; and if t is large, it is very likely to have a large initial size.) 
Therefore, Z A is not stochastically smaller than Z in the general case. 



We may also consider the TMRCA of the immortal individual and individuals taken in- 
dependently and uniformly among the current population living at time t. Let J" C X be 
the indices of the clans of the randomly chosen n individuals alive at time t. (One indi- 
vidual chosen at random in the population at time t belongs to the clan, i with probability 
Yj_ t ./Z t .) Notice that Card (Jf 1 ) < n. The TMRCA for the n individuals alive at time t and 
the immortal individual is given by: 

A™ :=-m£{U;i€ J?,ieX}. 

Because of the stationarity, we shall focus on t = and write A n for Af. The joint law of Z 
and A n can be characterized by the following result. 
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Theorem 4.7. For any n > 1 and any A,T > 0, we have 



E 



Z n e- xz h 



l {A n <T} 

Proof. By definition, we have: 
E 



e~ ac l ^n*a 8 n (il>{u{\ + n,T)) 

^(u(A,T)) ( j 



d n 77 \ ip(X + rj) 



77=0 



Zn — AZ i 
e L{A n <T} 



E 



E 



rln n 



z 



*n "*in TT -, -AZ 

k=l 



n ^ 

fc=i 

y AA 3 (ds,dy) F_ S 1 { _ S < T} ) exp (-X j N z {ds,dY) YL 



(_l)n " E 
v 1 d n rj 



exp(- J N z {ds,dY) (77y_ s l { _ s < T} + AY_ S ) 



<9 n 

-If— exp 
a n r? 



ij)'(u(X, s)) — / ds ijj' (u(X + n, s)) 



r)=0 



r)=0 



where A/3 in the second equality is defined by ()13p . The result then follows from ()20p and 
(1271). □ 



Remark 4.8. Following almost the same lines as the proof of Theorem l4.7| one can characterize 
explicitly the joint distribution of { (Z rj , A r J) ; 1 < j < mj for any m, n\, ■ ■ ■ , n m 6 N* and 
—00 < ri < r2 < • • • < r m < 00. 

5. Number of old families 

We now consider the number families in the oldest clan alive at time 0. This correspond 
to the number of individuals involved in the last coalescent event of the genealogical tree. To 
this end, we take the representation (|12p for Z . 



Definition 5.1. The number of oldest families alive at time (excluding the immortal par- 
ticle) is defined by: 



(37) 



N J 



E* 



{A=-t s , Yi t >0} 



{A=-tj,0>-*j} 



We have N A > 1. In the particular case tt = and /3 > 0, we have J = J2 and N A = 1. 

The following proposition give the joint law of A, N A and Z. 
Proposition 5.2. We have for a G [0, 1], A > 0, t > 0, 



E 



a N \- xz \A = t 



ip'jcjt)) - - q)c(t) + cm(A, t)) e _ #(«(A,r)) dr 



and 



E 



V/(c(*)) -^((l-a)c(t)) _ ^((l-o)c(t) 
#(c(*)) $'(c(t) X 
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Proof. Recall notations from Section 13.11 For i £ X, we set J* = J\ \ if i e / and J* = {i} if 
i £ Ji- Given any non-negative function /, we have, using (|12p and f)16|) : 



E 



E 



f(-ti) l{yi ^ 0} 1 r 1 



ds /(s) E 



e - A Efegi y -t fc 1 {t k >-s} 



^2 1 {t k <~s, Y s k >0} 



fc/<*« {Y k . >0} 

7. 





x 2/3N 



ae Aysl {n>o} 



+ / in{di)E l 

(0,+oo) 



jEjeJ 3 1 {v i '>o} e - A E je .j 3 Yi x 



{E jeJ3 ^>o} 



where SjeJ3 &Yi(AY) is under E^ a Poisson point measure with intensity £N[dy]. We have 



E 



~ A EfcGZ ^-ti. 1 {*fc>-s} 



E 1 



-/ s dr^'(n(A,r))-/ s °°dr^( C (r)) 



We also have 



N 



e~ XY ° 1 



{n>o} 



N[Y S > 0] - N[l - e" An ] = c(s) - u(A, s). 



and 



jEjeJg 1 {vi>0} e - A E je j 3 V? 1 



{E i6Ja ^'>°> 
E j6 ./ 3 V/>o> e A E, /, 



exp (-£N[(1 - ae- Ay ")l {ys>0} ]) - exp (-£N[Y S > 0]) 

exp (-£N[y s > 0] + ^aN[e- Ays ]l { y s>0} ]) - exp (-£N[Y S > 
exp (—£((1 — a)c(s) — au(X, s))j — exp (— ^c(s)) . 



Thus, we get: 
2/3N 



ae XYs l{r 3>0 } 



(0,+oo) 



a Ej6J 3 1 { y/>0} e - A E je j 3 x 



{E jeJ3 «>o}_ 
V>'(c(s)) - - a)c(s) + o«(A, s)). 



Putting things together, we obtain: 



E 



a NA e~ xz f(A) 



dsf(s) e-J'o*^(«(V))-/. 00 *^( c W) [^'(c( fl ))-^'((l-o)c(a) + a«(A,«))] . 
Then, use for the density of A to get the result. □ 
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Corollary 5.3. We have: 



(:?*) F(N A = n\A = t) = (-l) w+1 < t ) n ^ +1) « t )) ner r. 

Suppose that ip"(0+) < oo (that is K[Z] < +00/ Then, we have 



E[N A \A = t]=tf'(0) ir 
Furthermore the function 1 1 — ► EfA^A = t] is non-increasing. 

Proof. The first two assertions are straightforward consequences of Proposition 15.21 To get 
the monotonicity of 1 1 — > E[A^|^4 = t], we simply notice that both 1 1 — > c(t) and 







are non-increasing. □ 
Remark 5.4. Suppose that ip"(Q+) < 00. We deduce from (f38j) that 



lim F(N A = 1\A = t) = 1. 

t— >+oo 

TA 



Thus, the distribution of N conditionally on {A = t} converges as t goes to infinity to 1. 
So roughly speaking N A is likely to be equal to 1 if the TMRCA (or age of the oldest clan 
alive) is large. Notice that if ijj"{0+) = +00, this result may be false (see the next Remark). 

Remark 5.5. Let us consider the stable cases, ip(X) = a\ + coA 1+Q °, with co > and qo £ 
(0, 1]. We deduce from Corollary 15.31 that 

E[a NA \A = t] = l-(l-a) ao . 

In particular N A is independent of A. The case ocq = 1 correspond to the quadratic branching 
mechanism and we get that a.s. N A = 1. For a G (0,1), we deduce from (I38p that: for 

n G N* 



71-1 

a J 



V(N A = n\A = t) = ±-a T](k 
n\ - LJ - 

k=l 

For «o G (0, 1), we have ip"(0+) = +00 and the result of Remark 15.41 does not hold. 



6. ASYMPTOTICS FOR THE NUMBER OF ANCESTORS 



The number iV_ a q defined by (|30j) of clans born before time — s and alive at time is non- 
decreasing and is distributed as a Poisson random variable with parameter A(s) given by (|3ip . 
As A(s) goes to infinity as s goes down to 0, we deduce that A_ Sj o tends to infinity almost 
surely as s 1 0+. A natural question is then how fast the numbers A_ Si o tend to infinity. It 
follows from the definition of the Poisson random measure A/3 in (|13p that {A^a-i/^q; s > 0} 
is Poisson process with parameter 1, and by the strong law of large numbers for Levy processes 
(see [5J), we deduce that 

lim — 7^ = 1 almost surely. 
40+ A(s) 

One can also ask how fast the number M s of ancestors at time — s of the current population 
living at time tends to infinity. To answer this question, we need to introduce the genealogy 
of the families. Notice the genealogy of a CB is a richer structure than the CB itself. 
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6.1. Genealogy of CB. We recall here the construction of the Levy continuum random tree 
(CRT) introduced in [34|, [33] and developed later in [16] for critical or sub-critical branching 
mechanism. The results of this section are mainly extracted from [16] . except for the next 
subsection which is extracted from 



6.1.1. Real trees and their coding by a continuous function. Let us first recall the definition 
of real trees. 

Definition 6.1. A metric space (T,d) is a real tree if the following two properties hold for 
every V\,v 2 £ T ■ 

• (Unique geodesic.) There is a unique isometric map f Vl>V2 from [0, d(vi,v 2 )] into T 
such that 

/m, 1)2(0) = v i an d fv 1 ,v 2 {d(vi,v 2 )) = v 2 . 

• (No loop.) If q is a continuous injective map from [0,1] into T such that q(0) = v\ 
and q(l) = v 2 , we have 

q([0,l])=f VuV2 ([0,d(v 1 ,v 2 )}). 
A rooted real tree is a real tree (T, d) with a distinguished vertex v@ called the root. 

Let (T, d) be a rooted real tree. The range of the mapping f Vl)V2 is denoted by [^1,^2] 
(this is the line between v\ and v 2 in the tree). In particular, for every vertex v £ T, [«0, vj is 
the path going from the root to v which we call the ancestral line of vertex v. More generally, 
we say that a vertex v is an ancestor of a vertex v' if v 6 [f0,v']. If (vk £ K) is a set of 
vertex of T, there is a unique a £ T such that [v0,a] = DfeeK 1^0)^1- We- call a the most 
recent common ancestor of (v)~ £ K). A leaf is a vertex which is the ancestor of itself only. 
We say that d(0, v) is the level (or generation) of the vertex v. 

We now recall the coding of a compact real tree by a continuous function g : [0, +00) — > 
[0, +00) with compact support and such that g(0) = 0. We also assume that g is not 
identically 0. For every < s < t, we set 

m g (s,t) = inf g(u) and d g (s,t) = g(s) + g[t) — 2m g (s,t). 

u£[s,t] 

We then introduce the equivalence relation s ~ t if and only if d g (s,t) = 0. Let T g be the 
quotient space [0, +00)/ ~. It is easy to check that d g induces a distance on T g . Moreover, 
(T g , d g ) is a compact real tree (see [T7], Theorem 2.1). We say that g is the height process of 
the tree T g . 

For instance, when g is a normalized Brownian excursion, the associated real tree is Aldous' 

CRT pg. 

6.1.2. The underlying Levy process. We present now how to define a height process that codes 
a random real trees describing the genealogy of a CB using a Levy process with Laplace 
exponent given by the branching mechanism tp. We shall consider only the case of the sub- 
critical branching mechanism ifj given by ([I]). 

Let X = (Xt,t > 0) be a R- valued Levy process with no negative jumps, starting from 
and with Laplace exponent ip under the probability measure P (and E the corresponding 

expectation): for A > 0, E e~ XXt = e #(A) . Since we assume that (3 > or vr((0, 1)) = +00, 

we get that a.s. X is of infinite variation. 

We introduce some processes related to X. Let I = (It,t > 0) be the infimum process of 
X, I t = info<s<t X s , and let S = (St,t > 0) be the supremum process, St = sup 0<s<t X s . We 
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will also consider for every < s < t the infimum of X over [s,t]: 



If = inf X r . 

s<r<t 



The point is regular for the Markov process X — I, and — / is the local time of X — I at 
(see [8], chap. VII). Let N be the associated excursion measure of the process X — I away 
from 0. Let a = inf{t > 0; Xt — h = 0} be the length of the excursion of X — I under N. We 
have Xq = Iq = N-a.e. 

Since X is of infinite variation, is also regular for the Markov process S — X. The local 
time, L = (Lt,t > 0), of S — X at will be normalized so that 

E[e~ A V]= e -^ A )/\ 
where L^ 1 = inf{s > 0; L s > t} (see also [5J Theorem VII.4 (ii)). 

6.1.3. The height process and the Levy CRT. For each t > 0, we consider the reversed process 
at time t, X® = (X S W ,0 < s < t) by: 

X® =X t -X (t _ s) _ if 0<s<t, 

and Xf = Xf. The two processes {X& , < s < t) and (X s ,0 < s < t) have the same 
law. Let be the supremum process of 1« and be the local time at of — X« 
with the same normalization as L. As assumption (1A1|) is in force, there exists a continuous 
modification H = (Ht,t > 0) of the process {lft\t > 0), see Theorem 1.4.3 in [16J. The 
process H is the so-called height-process and (Th, dn) is the corresponding Levy tree. Notice 
that N-a.e. we have Ht = for t> a. 

6.1.4. Local time for the height process and CB. We now check that Th represents the ge- 
nealogy of a CB with branching mechanism tjj. 

The local time of the height process is defined through the next result, see [16], Lemma 
1.3.2 and Proposition 1.3.3. 

Proposition 6.2. There exists a jointly measurable process (L^,a > 0, s > 0) which is 
continuous and non- decreasing in the variable s such that: 



• For every t > 0, lim sup E 

£->o a>0 



For every t > 0, lim sup E 

a>e 



sup 


e- 1 


L 


_s<t 






sup 


e- 1 




_s<t 







l{a<H r <a+e} dr - L® 



0. 



/ l{a-e<H r <a} dr - L% 
JO 

• P-o.s.j for every t > 0, L° = —I t . 

• The occupation time formula holds: for any non-negative measurable function g on 

M+ and any s > 0, / g{H r ) dr = I g(a)L1 da. 

Jo J(0,+oo) 

Let T x = inf{t > 0;^ < — x}. We have the following Ray-Knight theorem which explains 
why the Levy CRT can be viewed as the genealogical tree of a CB. 

Proposition 6.3 (|16j. Theorem 1.4.1). The process {L9p , a > 0) is distributed under P as 
Y under P x (i.e. is a CB with branching mechanism ip starting at x). 

We then get the following Corollary. 

Corollary 6.4. The process L(H) = (L%,a > 0) is distributed under the excursion measure 
N as Y under its excursion measure N. 

Informally, counts the number of vertices (in fact leaves) of Th at level a under N. 
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6.1.5. Poissonian representation of the height process above a level. Let a > be fixed. We 
consider the excursions of the height process H above a under the excursion measure N. 
Precisely, let (uk,Vk), k £ /C be the excursions of H above a over the time interval [0, a]. We 
set H k = (H Uk+sAVk - a, s > 0). 

The next result is a consequence of Proposition 4.2.3 in [16J. 

Proposition 6.5. Conditionally on (L r a ,r < a), the measure ^2kefC^H k (dH) is a Poisson 
point measure with intensity L%N[dH}. 

We give a definition for the number of ancestors, which will be used in the next section. 

Definition 6.6. The number of ancestors at time a of the population (coded by H) living at 
time b is the number of excursions of H above level a which reach level b > a: 

Ra,b(H) = ^2 l{C fe >&-a}) 

where £fc = max{if^, s > 0}. 

6.2. Genealogy of Z. Recall notations from Section 16.1.51 In order to simplify notations, 
we shall write N for N. 

We use formulation (fT2|) to construct the genealogy of Z. Recall notation A/q from Sec- 
tion O 

• Conditionally on A/"o, let J\Fi(dt,dH) = YljeJi &t ,Hi(dt,dH) be a Poisson point mea- 
sure with intensity v{dt) N[dff] with v(dt) = Ylii&i r i^u{dt). 

• Let J\f2{dt,dH) = Ylij^j 2 ^t :j ,Hi{^->^) be a Poisson point measure independent of 
(A/o,A/i) and with intensity 2/3 dt N[dH]. 

We will write Y 3 for L(H 3 ) for j G J = J\ \} J?,. Thus notation (|12|) is still consistent with 
the previous Sections, thanks to Corollary 16.41 And the process YljeJ ^tj,m allows to code 
for the genealogy of the families of Z. 

Let s > 0. Following Definition 16.61 we consider M s the number of ancestors at time — s 
of the current population living at time 0, not including the immortal individual: 

•u- > /. /.;//'). 

6.3. Asymptotics for the number of ancestors. We first give a technical Lemma, which 
proof is postponed to the end of this Section. 

Lemma 6.7. The joint distribution of M s and Zq is characterized by the following equation: 
for i], A > s > 0, 

' e -Z- s [(l-e-V) c (s)+e-*u(\,s)] _ 

In particular, M s has the same distribution as V%_ s , where V s is a Poisson process with 
parameter c{s) independent of (Zt,t G M). 

Remark 6.8. Note that one can replace Z_ s by Zq for the right hand side of (|39p thanks 
to stationarity. The effect of our presentation is to emphasize the branching property: con- 
ditionally on the number of families with lifetime larger than s is a Poisson random 
variable with parameter the product of population size Z_ s and the rate c(s) = N(C > s) 
that one family has lifetime lager than s. 



(39) 



E 



■Jo dr '/''O(V)) je 
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The next result is the analogue of the result on the number of ancestors for coalescent 
process given in [6] and [37]. 



Theorem 6.9. The following convergence holds in probability: 

M, 



lim 



>o c(s) 



Z . 



Proof. Let p > 0. We take r\ = p/c(s). We deduce from (i39j) that: 



limE 

s->0 



E 



-Z (p+\) 



This implies that 



M, 



c\s 



, Zq I converges in distribution to (Zq, Zq), which gives the result. □ 



Remark 6.10. Suppose in addition that x 2 ir(dx) < oo. Set Tr(dx) = x 2 ir(dx). Then the 
7f-coalescent defined in [B] comes down from infinity by the assumption (|A2p (see [B] and 
the references therein). It was shown in [6j that the speed of coming down from infinity 
satisfies 



(40) 



lim — — = 1 almost surely. 
40+ c(t) 



Prom the heuristic duality between coalescence and branching processes, our result in Theo- 
rem [6j9] can be seen as a duality to (|40p. 



Proof of lemmaWTA For any 77, A > 0, we have: 



E 



-r]M s -XZ 



=E 



- x T. je j 1 {-«<t 1 -<o} y -t„. 



E 



exp 



- V M S -Xj^ 1 



{tjK-syY-ti 



■ exp 



dr tj}'(u(X, r)) 



E 



(41) 



■ exp 



<<x-i» I - E htK-s} [vR-s-ti,-^) + \YL U 

dr tj}'(u(X, r)) 



exp 



da ft (N[l - eM-vRa, a+ s(H) - XY a+s )]) , 



where we used that Poisson random measures over disjoint sets are independent in the first 
equality, Lemma 13.11 in the second equality and a immediate generalization of Lemma 13.11 to 
genealogies in the third equality. 

Using notations from Section 16.1.51 on the Poissonian representation of the height process 
above level a from Proposition 16.51 we get 

"l _ e -E feeK ^ {Cfc > a }+^(^ fc ), 



N 



1 



-r)Ra, a + a (H)-\Ya-l 



N 
N 



1 



-y a N[l-exp(-r,l {c > s} -AY s )] 



As 1 - exp (-r?l{c>s} - XY S ) = (1 - e ?? )l{c>s} + e ''(l - e s ), we deduce that 



N 



-rjR a ^ a+s (H)-XY a+s 



N 



-X'Y a 



U 



(X',a) , 
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with A' = (1 — e~ r, )c(s) + e~ v u(X, s). Then we use (|3.6p to write 

exp (- da il (N[l - exp(-7 ? i? a , a+s (i/) + AF a+s )])J = exp ^- da ^ («(A',o))^ 

= E e~ X ' Z ~ 

Plugging this in (|4"T|) . we get ([39]) . □ 

7. The quadratic branching mechanism 

Let (efc; fc G N) be independent exponential random variables with mean 1. 

7.1. Preliminaries. In this Section we give some explicit distributions and more precise 
results for the case of quadratic branching mechanism: 

(42) V(A) = /3A 2 + 2/30A, 

where j3 > and > 0. We have 

, . 20A ,.20 

n(V)= (20 + A)e 2 ^-A ' C ^ = e^l' ^* = 

For every t G R, it follows from Corollary 13.31 that the process {Z s+ t,s > 0} has the same 
distribution as the strong solution of the following stochastic differential equation 



dX s = ^2fiX s dW s + 2/3(1 - 6X s )ds, 

with initial law W'(Zq G •), where W is a standard Brownian motion (see |44j Section XI. 3 for 
the existence of strong solution). 

7.2. Joint law of the TMRCA and populations sizes. We have the following represen- 
tations. 

Theorem 7.1. Assume ij) is given by O^D . 
(i) We have for A > 0: 

-AZi ( 26 \ 2 „ (d ) 1 



(43) E[ e— ] = ^— j ^ z = ^ (ei+e2) - 

(mJ We Ziaue /or f > 0: 



(44) F(A < t) = (1 - e~ 2m ) 2 and A = max(ei, e 2 ). 

20p 

(mj Conditionally on {A = t}, we have the following distribution representation: 

(45) {Z\Z\ZP)® ( 61+62 63 + 64 65 



20 + c(t) ' 20 + c(t) ' 20 + c(i) 



Proof. By Lemma [T9l we have 



29 + X / 

This gives (i). Using Theorem 14. 11 we obtain: 

Ek -^-^-n Z o. A e . = 2/3(20)^(^-1) 

L '^ tut J [( 2 + 7? )e 2 ^-r ? ][(20 + 7 )e 2e / 3i -7] 2 [(20 + A)e 2 ^-A] 2 

We then deduce (ii) and (hi). □ 
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We then are able to compare more precisely the size of the current population Z = Z 1 + Z° 
with the size of the population Z A just before the birth time of the MRCA. As (Zt,t € M) 
is continuous, notice that that Z is also the size of the population at the birth time of the 
MRCA. Recall that Z A is stochastically smaller than Z. The next Corollary indicates that 
Z A is however not a.s. smaller than Z. 



Corollary 7.2. Assume ip is given by We have: 



a.s. 



as well as 



Proof. We have 



P(Z A < Z\A) = ^ and E[Z A \A] = \^[Z\A] 
¥{Z A <Z) = ^- and E[Z A ] = -M[Z]. 



¥{Z A < Z\A) = P(ei + e 2 < e 3 + e 4 + e 5 ) = — . 



The other equalities are obvious. □ 

There is also an interesting result (which is not valid for general branching mechanism) 
which can be interpreted by time reversal. Recall £ is the extinction time of Y. 

Proposition 7.3. Assume ip is given by ^42fy . Conditionally on Z , A is distributed as Q 
under P^: for all t > 

(46) F(A>t\Z) =e~ c{t)z = P Z (( <t). 

Proof. We deduce from (j33"|) and ((331) that the densities of Z and A are: 

(47) f A (t) = 46f3e- 2 ^ t (l-e- 2 ^ t )l {t>0} and f z (z) = (28) 2 z e~ 26z l {z>0} . 
We also deduce from (|45p the density of Z conditionally on A = t: 

fz\A=t(z) = (29 + c(t)) 3 z 2 e-( 2e+c ^ z 1 {Z>0} . 
Using Bayes' rule, we get the density of A conditionally on Z = z: for z, t > 

We obtain P(A <t\Z) = e~ c ^ z . Then, we conclude as 

Pr(C<0 = e" rN[C -* ] =e- rc W, 
where we used the Poissonian representation of Y given by ((7|). □ 
Notice that (|46p implies that 

F(c(A)Z > c(t)Z\Z) = P(A < t\Z) = e~ c{t)z . 

We obtain that c(A)Z is independent of Z and c(A)Z = e%. We thus deduce the following 
Corollary. 

Corollary 7.4. Assume ip is given by |^^| ). We have the following representation: 

Z, c(A), Z ) = ofl , 26> — e 4 + e 5 

\ 26> ei + e 2 2& ei + e 2 + e 3 

Remark 7.5. It is also easy to check that conditionally on {Z = z}, A is distributed as 
— — log ( H J . In particular, we deduce that ^4 is distributed as — — log ( 1 + ' 



2/36* ° V e 3 7 ' 2/36» ° V e 3 
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7.3. TMRCA for n individuals. Next, we consider the joint distribution of Z and A n the 
TMRCA of the immortal individual and n individuals chosen at random among the current 
population. The next result is a direct application of Theorem 14.71 

Proposition 7.6. Assume ip is given by \4%fy - We set s = 1 — e~ 2 @ st . We have for n £ N*: 
E 



(n + l)!s n / 20 



ryn „ — XZ -i 

z ( ^mij {29 + Xs)n y 2 e + x / 

and the size-biased distribution of A n is the maximum of n independent exponential random 
variables with mean 1: 

E [Z n l {A n em} ] =E[Z"](l-e- 2 ^r. 

We can compute explicitly the distribution of A . See also [3D], section 3, for similar 
computations in a slightly different setting. 

Proposition 7.7. Assume tp is given by \4>ty - We set s = 1 — e~ 2l39t . We have: 

(48) PM 1 < t) = 2—^- ( 1 + — ^log(s) ) and F(c(A 1 )Z>x\Z) = ---^(l-e- x ). 

1 — s \ 1 — S J XX 1 

In particular c(A 1 )Z is independent of Z. 

Notice that ¥(A < t) = s 2 so that we recover from ()48j) the trivial inequality ^(A 1 < t) > 
P(A < t) as A > A 1 . 

Proof. Applying Theorem 14.71 we get 

/»oo 

E[e~ AZ l {A i< t} ] = J dr, E[Z e~" z l {A i< t} ] 

(49) =2(e 2 ^-l) 2 f 7 ^,^ T -logfl + ' 20 



( e 2e/3t -l) 28 + \ b V (e 29/3 * -1) 26 + X 
In particular, the distribution of A 1 is given by 

P(A* <t) = 2(e 2 ^ -I) 2 (t^T) " lo * i 1 + piij) ) • 

Applying inverse Laplace transforms to (|49p and using the density of Z given in (|47p , we get 
that the conditional law of A 1 given Z: 

2(e 2< ""-l) 2 / 20 8 -»z/(^*"-l)_l\ 

' ( ^ l|z| = i wrH + z )• 

which implies that 

r-2 



(20Z/(e 2 ^ Al -1) > s) = - - 1(1 - e"^ 



□ 



7.4. Fluctuations for the renormalized number of ancestors. Finally, we complete 
Theorem 16.91 by giving the fluctuations for the renormalized number of ancestors. 

Theorem 7.8. Assume ip is given by (L/j?| ). We have 

V4mz](^-z)^U(z-z% 

\c(s) J 40+ 
where Z' is distributed as Z and independent of Z . 
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Proof. We first note that for every A > 0, 

$ (u(A v / c(s),r)) dr < sip' (Ay 7 ^) — 0, 



and 



lim(l - e A/ V^)c(s) + e A/ V^ u(X^/4sj, s) = -X 2 /2. 

s— >0 



Under the current assumption on the exponent ift, E[e A ^] < 00 and E[e AMa ] < 00 for A > 
small enough. Hence, by an analytic continuation argument, we see that f|39|) implies that 
for A > small enough, the following holds for all small s: 
(50) 



E 



- Jo $'("( A \A(s)> r )) dr £ 



Hence, for all small A > 0, we have 



limE 



E 



20 



29- A 2 /2 

since E[Z] = 1/0. The result is then a consequence of [40] . 



E 



□ 
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